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SEQUENCE LISTING 

(1) GENERAL INFORMATION 
(i) APPLICANT: SmithKline Beecham Biologicals S.A. 
(ii) TITLE OF THE INVENTION: Vaccine 



(iii) NUMBER OF SEQUENCES: 27 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SmithKline Beecham 

(B) STREET: Two New Horizons Court 

(C) CITY: Brentford 

(D) STATE: 

(E) COUNTRY: Middx, UK 

(F) ZIP: TW8 9EP 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 26-SEP-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY /AGENT INFORMATION : 

(A) NAME: Bor, Fiona R 

(B) REGISTRATION NUMBER: 

(C) REFERENCE /DOCKET NUMBER: 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 0181 975 2817 

(B) TELEFAX: 0181 975 6141 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
ATCGTCCATG . GGT . GGC . A AG . TGG . T 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
CGGCTACTAG TGCAGTTCTT GAA 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ATCGTACTAG T. GAG. CCA. GTA.GAT.C 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CGGCTACTAG TTTCCTTCGG GCCT 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
ATCGTCCATG GAGCCAGTAG ATC 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 41 base pairs 

(B) TYPE: nucleic "acid 

(C) STRANDEDNESS : single 
{ D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ATGGATCCAA AAACTTTAGC CCTTTCTTTA TTAGCAGCTG GCGTACTAGC AGGTTGTAGC 60 

AG C CAT T CAT CAAATATGGC GAATACCCAA AT G AAAT C AG ACAAAATCAT TATTGCTCAC 120 

CGTGGTGCTA GCGGTTATTT ACCAGAGCAT ACGTTAGAAT CTAAAGCACT TGCTTTTGCA 180 

CAACAGGCTG ATTATTTAGA GCAAGATTTA GCAATGACTA AGGATGGTCG TTTAGTGGTT 24 0 

AT T C AC GAT C ACTTTTTAGA TGGCTTGACT GATGTTGCGA AAAAATTCCC ACATCGTCAT 300 

CGTAAAGATG GCCGTTACTA TGTCATCGAC TTTACCTTAA AAGAAATTCA AAGTTTAGAA 360 

AT G AC AG AAA ACTTTGAAAC CATGGCCACG TGTGATCAGA GCTCAACTAG TGGCCACCAT 4 20 

CACCATCACC ATTAATCTAG A 441 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
fj; (A) LENGTH: 14 4 amino acids 

^ (B) TYPE: amino acid 

^ (C) STRANDEDNESS: single 

TU (D) TOPOLOGY: linear 

i; (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



Met 


Asp 


Pro 


Lys 


Thr 


Leu 


Ala 


Leu 


Ser 


Leu 


Leu 


Ala 


Ala 


Gly 


Val 


Leu 


1 








5 










10 










15 




Ala 


Gly 


Cys 


Ser 


Ser 


His 


Ser 


Ser 


Asn 


Met 


Ala 


Asn 


Thr 


Gin 


Met 


Lys 








20 










25 










30 






Ser 


Asp 


Lys 


He 


He 


He 


Ala 


His 


Arg 


Gly 


Ala 


Ser 


Gly 


Tyr 


Leu 


Pro 






35 










40 










45 








Glu 


His 


Thr 


Leu 


Glu 


Ser 


Lys 


Ala 


Leu 


Ala 


Phe 


Ala 


Gin 


Gin 


Ala 


Asp 




50 










55 










60 










Tyr 


Leu 


Glu 


Gin 


Asp 


Leu 


Ala 


Met 


Thr 


Lys 


Asp 


Gly Arg 


Leu 


Val 


Val 


65 










70 










75 










80 


He 


His 


Asp 


His 


Phe 


Leu 


Asp 


Gly 


Leu 


Thr 


Asp 


Val 


Ala 


Lys 


Lys 


Phe 










85 










90 










95 




Pro 


His 


Arg 


His 


Arg 


Lys 


Asp 


Gly 


Arg 


Tyr 


Tyr 


Val 


He 


Asp 


Phe 


Thr 








100 










105 










110 






Leu 


Lys 


Glu 


He 


Gin 


Ser 


Leu 


Glu 


Met 


Thr 


Glu 


Asn 


Phe 


Glu 


Thr 


Met 






115 










120 










125 








Ala 


Thr 


Cys 


Asp 


Gin 


Ser 


Ser 


Thr 


Ser 


Gly 


His 


His 


His 


His 


His 


His 



130 135 140 



(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

ATGGGTGGCA AGTGGTCAAA AAGTAGTGTG -GTTGGATGGC CTACTGTAAG GGAAAGAATG 60 

AGACGAGCTG AGCCAGCAGC AGATGGGGTG GGAGCAGCAT CTCGAGACCT GGAAAAACAT 120 

GGAGC AATCA CAAGTAGCAA TACAGCAGCT ACCAATGCTG CTTGTGCCTG GCTAGAAGCA 180 

CAAGAGGAGG AGGAGGTGGG TTTTCCAGTC ACACCTCAGG TACCTTTAAG ACCAATGACT 24 0 

TACAAGGCAG CTGTAGATCT TAGCCACTTT TTAAAAGAAA AGGGGGGACT GGAAGGGCTA 300 

ATTCACTCCC AACGAAGACA AGATATCCTT GATCTGTGGA TCTACCACAC ACAAGGCTAC 360 

TTCCCTGATT GGCAGAACTA CACACCAGGG CCAGGGGTCA GATATCCACT GACCTTTGGA 420 

TGGTGCTACA AGCTAGTACC AGTTGAGCCA GATAAGGTAG AAGAGGCCAA TAAAGGAGAG 4 80 

AACACCAGCT TGTTACACCC TGTGAGCCTG CATGGAATGG ATGACCCTGA GAGAGAAGTG 540 

TTAGAGTGGA GGTTTGACAG CCGCCTAGCA TTTCATCACG TGGCCCGAGA GCTGCATCCG 600 

GAGTACTTCA AG AACT G C AC TAGTGGCCAC CATCACCATC ACCATTAA 648 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Gly Gly Lys Trp Ser Lys Ser Ser Val Val Gly Trp Pro Thr Val 

1 5 10 . 15 

Arg Glu Arg Met Arg Arg Ala Glu Pro Ala Ala Asp Gly Val Gly Ala 

20 25 30 

Ala Ser Arg Asp Leu Glu Lys His Gly Ala He Thr Ser Ser Asn Thr 

35 40 45 

Ala Ala Thr Asn Ala Ala Cys Ala Trp Leu Glu Ala Gin Glu Glu Glu 

50 55 60 

Glu Val Gly Phe Pro Val Thr Pro Gin Val Pro Leu Arg Pro Met Thr 
65 70 75 80 

Tyr Lys Ala Ala Val Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly 

85 90 95 

Leu Glu Gly Leu He His Ser Gin Arg Arg Gin Asp He Leu Asp Leu 

100 105 HO 

Trp He Tyr His Thr Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr 

115 120 125 

Pro Gly Pro Gly Val Arg Tyr Pro Leu Thr Phe Gly Trp Cys Tyr Lys 

130 135 140 

Leu Val Pro Val Glu Pro Asp Lys Val Glu Glu Ala Asn Lys Gly Glu 
145 150 155 160 

Asn Thr Ser Leu Leu His Pro Val Ser Leu His Gly Met Asp Asp Pro 

165 170 175 

Glu Arg Glu Val Leu Glu Trp Arg Phe Asp Ser Arg Leu Ala Phe His 

180 185 190 

His Val Ala Arg Glu Leu His Pro Glu Tyr Phe Lys Asn Cys Thr Ser 

195 200 205 

Gly His His His His His His 
210 215 

(2) INFORMATION FOR SEQ ID NO: 10: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 base pairs 

(B) TYPE: _ nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

ATGGAGCCAG TAGATCCTAG ACTAGAGCCC TGGAAGCATC CAGGAAGTCA GCCTAAAACT 60 

GCTTGTACCA ATTGCTATTG TAAAAAGTGT TGCTTTCATT GCCAAGTTTG TTTCATAACA 120 

AAAGCCTTAG GCATCTCCTA TGGCAGGAAG AAGCGGAGAC AGCGACGAAG ACCTCCTCAA 180 

GGCAGTCAGA CTCATCAAGT TTCTCTATCA AAGCAACCCA CCTCCCAATC CCGAGGGGAC 24 0 

CCGACAGGCC CGAAGGAAAC TAGTGGCCAC CATCACCATC ACCATTAA 288 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 96 amino acids 
Q (B) TYPE: amino acid 

03 (C) STRANDEDNESS: single 

01 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 





Met 


Glu 


Pro 


Val 


Asp 


Pro 


Arg 


Leu 


Glu 


Pro 


Trp 


Lys 


His 


Pro 


Gly 


Ser 




1 








5 










10 










15 






Gin 


Pro 


Lys 


Thr 


Ala 


Cys 


Thr 


Asn 


Cys 


Tyr 


Cys 


Lys 


Lys 


Cys 


Cys 


Phe 


i=J 






20 








25 










30 




Gly 


: • \ 


His 


Cys 


Gin 


Val 


Cys 


Phe 


He 


Thr 


Lys 


Ala 


Leu 


Gly 


He 


Ser 


Tyr 






35 










40 










45 










Arg 


Lys 


Lys 


Arg 


Arg 


Gin 


Arg 


Arg. Arg 


Pro 


Pro 


Gin 


Gly 


Ser 


Gin 


Thr 




50 










55 










60 












His 


Gin 


Val 


Ser 


Leu 


Ser 


Lys 


Gin 


Pro 


Thr 


Ser 


Gin 


Ser 


Arg 


Gly 


Asp 


■5 ' '"5 


65 










70 








75 








His 


80" 




Pro 


Thr 


Gly 


Pro 


Lys 


Glu 


Thr 


Ser 


Gly 


His 


His 


His 


His 


His 












85 










90 










95 





(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 909 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

ATGGGTGGCA AGTGGTCAAA AAGTAGTGTG GTTGGATGGC CTACTGTAAG GGAAAGAATG 60 

AGACGAGCTG AGCCAGCAGC AGATGGGGTG GGAGCAGCAT CTCGAGACCT GGAAAAACAT 120 

GGAGCAAT C A CAAGTAGCAA T AC AGCAGC T ACCAATGCTG CTTGTGCCTG GCTAGAAGCA 180 

CAAGAGGAGG AGGAGGTGGG TTTTCCAGTC ACACCTCAGG TACCTTTAAG ACCAATGACT 24 0 

TACAAGGCAG CTGTAGATCT TAGCCACTTT TTAAAAGAAA AGGGGGGACT GGAAGGGCTA 300 

ATTCACTCCC AACGAAGACA AG AT AT CCT T GATCTGTGGA TCTACCACAC ACAAGGCTAC 360 



SUBSTITUTE SHEET (RULE 26) 



WO 99/16884 PCT/EP98/06040 

6 / 15 

TTCCCTGATT GGCAGAACTA CACACCAGGG CCAGGGGTCA GATATCCACT GACCTTTGGA 420 

TGGTGCTACA AGCTAGTACC AGTTGAGCCA GATAAGGTAG AAGAGGCCAA TAAAGGAGAG 4 80 

AACACCAGCT TGTTACACCC TGTGAGCCTG CATGGAATGG ATGACCCTGA GAGAGAAGTG 540 

TTAGAGTGGA GGTTTGACAG CCGCCTAGCA TTTCATCACG TGGCCCGAGA GCTGCATCCG 600 

"gagtacttca AGAACTGCAC TAGTGAGCCA GTAGA-TCCTA GACTAGAGCC CTGGAAGCAT 660 

CCAGGAAGTC AGCCTAAAAC TGCTTGTACC AATTGCTATT GTAAAAAGTG TTGCTTTCAT 720 

TGCCAAGTTT GTTTCATAAC AAAAGCCTTA GGCATCTCCT ATGGCAGGAA GAAGCGGAGA 780 

CAGCGACGAA GACCTCCTCA AGGCAGTCAG ACTCATCAAG TTTCTCTATC AAAGCAACCC 840 

ACCTCCCAAT CCCGAGGGGA CCCGACAGGC CCGAAGGAAA CTAGTGGCCA CCATCACCAT 900 
CACCATTAA 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 303 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



909 



Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Gly Gly Lys TrD Ser Lys Ser Ser Val Val Gly Trp Pro Thr Val 
^ 1 5* 10 15 

Arq Glu Arg Met Arg Arg Ala Glu Pro Ala Ala Asp Gly Val Gly Ala 
!H 20 25 30 

— Ala Ser Arg Asp Leu Glu Lys His Gly Ala lie Thr Ser Ser Asn Thr 

4U 35 40 45 

HQ Ala Ala Thr Asn Ala Ala Cys Ala Trp Leu Glu Ala Gin Glu Glu Glu 

50 55 60 

-r3 Glu Val Gly Phe Pro Val Thr Pro Gin Val Pro Leu Arg Pro Met Thr 

£j 65 70 75 80 

SI Tyr Lys Ala Ala Val Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly 

^ 85 90 95 

■IS? Leu Glu Gly Leu He His Ser Gin Arg Arg Gin Asp He Leu Asp Leu 

; y ioo los no 

Q Trp He Tyr His Thr Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr 

115 120 125 

Pro Gly Pro Gly Val Arg Tyr Pro Leu Thr Phe Gly Trp Cys Tyr Lys 

130 135 140 

Leu Val Pro Val Glu Pro Asp Lys Val Glu Glu Ala Asn Lys Gly Glu 
145 150 155 160 

Asn Thr Ser Leu Leu His Pro Val Ser Leu His Gly Met Asp Asp Pro 

165 170 175 

Glu Arg Glu Val Leu Glu Trp Arg Phe Asp Ser Arg Leu Ala Phe His 

180 185 190 

His Val Ala Arg Glu Leu His Pro Glu Tyr Phe Lys Asn Cys Thr Ser 

195 200 205 

Glu Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser Gin 

210 215 220 

Pro Lys Thr Ala Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Phe His 
225 230 235 240 

Cys Gin Val Cys Phe He Thr Lys Ala Leu Gly He Ser Tyr Gly Arg 

245 250 255 

Lys Lys Arg Arg Gin Arg Arg Arg Pro Pro Gin Gly Ser Gin Thr His 

260 265 270 

Gin Val Ser Leu Ser Lys Gin Pro Thr Ser Gin Ser Arg Gly Asp Pro 
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275 280 285 

Thr Gly Pro Lys Glu Thr Ser Gly His His His His His His 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1029 base pairs 
£B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

ATGGATCCAA AAACTTTAGC CCTTTCTTTA TTAGCAGCTG GCGTACTAGC AGGTTGTAGC 60 

AGCCATTCAT CAAATATGGC GAATACCCAA ATGAAATCAG ACAAAATCAT TATTGCTCAC 120 

CGTGGTGCTA GCGGTTATTT ACCAGAGCAT ACGTTAGAAT CTAAAGCACT TGCTTTTGCA 180 

CAACAGGCTG ATTATTTAGA GCAAGATTTA GCAATGACTA AGGATGGTCG TTTAGTGGTT 24 0 

ATTCACGATC ACTTTTTAGA TGGCTTGACT GATGTTGCGA AAAAATTCCC ACATCGTCAT 300 

CGTAAAGATG GCCGTTACTA TGTCATCGAC TTTACCTTAA AAGAAATTCA AAGTTTAGAA 360 

ATGACAGAAA ACTTTGAAAC CATGGGTGGC AAGTGGTCAA AAAGTAGTGT GGTTGGATGG 420 

CCTACTGTAA GGGAAAGAAT GAGACGAGCT GAGCCAGCAG CAGATGGGGT GGGAGCAGCA 480 

TCTCGAGACC TGGAAAAACA TGGAGCAATC ACAAGTAGCA ATACAGCAGC TACCAATGCT 54 0 

GCTTGTGCCT GGCTAGAAGC ACAAGAGGAG GAGGAGGTGG GTTTTCCAGT CACACCTCAG 600 

GTACCTTTAA GACCAAT G AC TTACAAGGCA GCTGTAGATC TTAGCCACTT TTTAAAAGAA 660 

AAGGGGGGAC TGGAAGGGCT AATTCACTCC CAACGAAGAC AAGATATCCT TGATCTGTGG 720 

ATCTACCACA CACAAGGCTA CTTCCCTGAT TGGCAGAACT ACACACCAGG GCCAGGGGTC 7 80 

AGATATCCAC TGACCTTTGG ATGGTGCTAC AAGCTAGTAC CAGTTGAGCC AGATAAGGTA 840 

GAAGAGGCCA ATAAAGGAGA GAACACCAGC TTGTTACACC CTGTGAGCCT GCATGGAATG 900 

GATGACCCTG AG AG AG AAG T GTTAGAGTGG AGGTTTGACA GCCGCCTAGC ATTTCATCAC 960 

GTGGCCCGAG AGCTGCATCC GGAGTACTTC AAGAACTGCA CTAGTGGCCA CCATCACCAT 1020 
CACCATTAA 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Cys Ser Ser His Ser Ser Asn Met Ala Asn Thr Gin Met Lys Ser Asp 

! 5 10 15 

Lys He He He Ala His Arg Gly Ala Ser Gly Tyr Leu Pro Glu His 

20 25 30 

Thr Leu Glu Ser Lys Ala Leu Ala Phe Ala Gin Gin Ala Asp Tyr Leu 

35 40 45 

Glu Gin Asp Leu Ala Met Thr Lys Asp Gly Arg Leu Val Val He His 

50. 55 60 

Asp His Phe Leu Asp Gly Leu Thr Asp Val Ala Lys Lys Phe Pro His 
65 70 75 80 

Arg His Arg Lys Asp Gly Arg Tyr Tyr Val He Asp Phe Thr Leu Lys 
85 90 95 



1029 
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Glu He Gin Ser Leu Glu Met Thr Glu Asn Phe Glu Thr Met Gly Gly 

100 105 11° 

Lys Trp Ser Lys Ser Ser Val Val Gly Trp Pro Thr Val Arg Glu Arg 

115 120 I 25 

Met Arg "Arg Ala Glu Pro" Ala Ala Asp Gly -Val Gly Ala Ala Ser Arg 

130 135 140 

Asp Leu Glu Lys His Gly Ala He Thr Ser Ser Asn Thr Ala Ala Thr 
145 150 155 IfO 

Asn Ala Ala Cys Ala Trp Leu Glu Ala Gin Glu Glu Glu Glu Val Gly 

165 170 I 75 

Phe Pro Val Thr Pro Gin Val Pro Leu Arg Pro Met Thr Tyr Lys Ala 

180 185 190 

Ala Val Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly 

195 200 205 

Leu He His Ser Gin Arg Arg Gin Asp He Leu Asp Leu Trp He Tyr 

210 215 220 

His Thr Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro 
225 230 235 240 

Gly Val Arg Tyr Pro Leu Thr Phe Gly Trp Cys Tyr Lys Leu Val Pro 

245 250 255 

Val Glu Pro Asp Lys Val Glu Glu Ala Asn Lys Gly Glu Asn Thr Ser 

260 265 270 

Leu Leu His Pro Val Ser Leu His Gly Met Asp Asp Pro Glu Arg Glu 

275 280 285 

Val Leu Glu Trp Arg Phe Asp Ser Arg Leu Ala Phe His His Val Ala 

290 295 300 

Arg Glu Leu His Pro Glu Tyr Phe Lys Asn Cys Thr Ser Gly His His 
305 310 315 320 

His His His His 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

ATGGATCCAA AAACTTTAGC CCTTTCTTTA TTAGCAGCTG GCGTACTAGC AGGTTGTAGC 60 

AGCCATTCAT CAAATATGGC GAATACCCAA ATGAAATCAG ACAAAATCAT TATTGCTCAC 120 

CGTGGTGCTA GCGGTTATTT ACCAGAGCAT ACGTTAGAAT CTAAAGCACT TGCGTTTGCA 180 

CAACAGGCTG ATTATTTAGA GCAAGATTTA GCAATGACTA AGGATGGTCG TTTAGTGGTT 24 0 

ATTCACGATC ACTTTTTAGA TGGCTTGACT GATGTTGCGA AAAAATTCCC ACATCGTCAT 300 

CGTAAAGATG GCCGTTACTA TGTCATCGAC TTTACCTTAA AAGAAATTCA AAGTTTAGAA 360 

ATGACAGAAA ACTTTGAAAC CATGGGTGGC AAGTGGTCAA AAAGTAGTGT GGTTGGATGG 420 

CCTACTGTAA GGGAAAGAAT GAGACGAGCT GAGCCAGCAG CAGATGGGGT GGGAGCAGCA 480 

TCTCGAGACC TGGAAAAACA TGGAGCAATC ACAAGTAGCA ATACAGCAGC TACCAATGCT 540 

GCTTGTGCCT GGCTAGAAGC ACAAGAGGAG GAGGAGGTGG GTTTTCCAGT CACACCTCAG 600 

GTACCTTTAA GACCAATGAC TTACAAGGCA GCTGTAGATC TTAGCCACTT TTTAAAAGAA 660 

AAGGGGGGAC TGGAAGGGCT AATTCACTCC CAACGAAGAC AAGATATCCT TGATCTGTGG 720 

ATCTACCACA CACAAGGCTA CTTCCCTGAT TGGCAGAACT ACACACCAGG GCCAGGGGTC 780 

AGATATCCAC TGACCTTTGG ATGGTGCTAC AAGCTAGTAC CAGTTGAGCC AGATAAGGTA 84 0 

GAAGAGGCCA ATAAAGGAGA GAACACCAGC TTGTTACACC CTGTGAGCCT GCATGGAATG 900 
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GATGACCCTG AGAGAGAAGT GTTAGAGTGG AGGTTTGACA GCCGCCTAGC ATTTCATCAC 960 

GTGGCCCGAG AGCTGCATCC GGAGTACTTC AAGAACTGCA CTAGTGAGCC AGTAGATCCT 1020 

AGACTAGAGC CCTGGAAGCA TCCAGGAAGT CAGCCTAAAA CTGCTTGTAC CAATTGCTAT 1080 

TGTAAAAAGT GTTGCTTTCA TTGCCAAGTT TGTTTCATAA CAAAAGCCTT AGGCATCTCC 114 0 

TATGGCAGGA AGAAGCGGAG ACAGCGACGA AGACCTCCTC AAGG C AG T CA GACTCATCAA 1200 

GTTTCTCTAT CAAAGCAACC CACCTCCCAA- TCCCGAGGGG ACCCGACAGG CCCGAAG^AA x2o0 

ACTAGTGGCC ACCATCACCA TCACCATTAA 1290 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Cys Ser Ser His Ser Ser Asn Met Ala Asn Thr Gin Met Lys Ser Asp 
« 1 5 10 15 

- Lys He He He Ala His Arg Gly Ala Ser Gly Tyr Leu Pro Glu His 

X 20 25 30 

yj Thr Leu Glu Ser Lys Ala Leu Ala Phe Ala Gin Gin Ala Asp Tyr Leu 

Q 35 40 45 

■*P Glu Gin Asp Leu Ala Met Thr Lys Asp Gly Arg Leu Val Val He His 

Ty 50 55 60 

ill Asp His Phe Leu Asp Gly Leu Thr Asp Val Ala Lys Lys Phe Pro His 

IS 65 70 75 80 

Arg His Arg Lys Asp Gly Arg Tyr Tyr Val He Asp Phe Thr Leu Lys 
« 85 90 95 

H Glu He Gin Ser Leu Glu Met Thr Glu Asn Phe Glu Thr Met Gly Gly 

jjf 100 105 HO 

lU Lys Trp Ser Lys Ser Ser Val Val Gly Trp Pro Thr Val Arg Glu Arg 

W 115 120 125 

Q Met Arg Arg Ala Glu Pro Ala Ala Asp Gly Val Gly Ala Ala Ser Arg 

O 130 135 140 

Asp Leu Glu Lys His Gly Ala He Thr Ser Ser Asn Thr Ala Ala Thr 
145 150 155 160 

Asn Ala Ala Cys Ala Trp Leu Glu Ala Gin Glu Glu Glu Glu Val Gly 

165 170 175- 

Phe Pro Val Thr Pro Gin Val Pro Leu Arg Pro Met Thr Tyr Lys Ala 

180 185 190 

Ala Val Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly 

195 200 205 

Leu He His Ser Gin Arg Arg Gin Asp He Leu Asp Leu Trp He Tyr 

210 215 220 

His Thr Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro 
225 230 235 240 

Gly Val Arg Tyr Pro Leu Thr Phe Gly Trp Cys Tyr Lys Leu Val Pro 

245 250 255 

Val Glu Pro Asp Lys Val Glu Glu Ala Asn Lys Gly Glu Asn Thr Ser 

260 265 270 

Leu Leu His Pro Val Ser Leu His Gly Met Asp Asp Pro Glu Arg Glu 

275 280 285 

Val Leu Glu Trp Arg Phe Asp Ser Arg Leu Ala Phe His His Val Ala 
290 295 300 
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Arg 


Glu 


Leu 


His 


Pro 


GlU 


Tyr 


f ne 


305 










•an a 






Asp 


Pro 


Arg 


Leu 


Glu 


Pro 


Trp 










*5 1 C 

325 








Ala 


Cys 


Thr 


Asn 


Cys 


Tyr 


Cys 
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385 
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His 


His 
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Lvs Asn Cys Thr Ser Glu Pro Val 
315 320 
His Pro Gly Ser Gin Pro Lys Thr 

330 335 
Lys- Cys Cys Phe His Cys Gin Val 
345 350 
He Ser Tyr Gly Arg Lys Lys Arg 
365 

Gly Ser Gin Thr His Gin Val Ser 
380 

Ser Arg Gly Asp Pro Thr Gly Pro 
395 400 
His His His 
410 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 981 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

ATGGATCCAA GCAGCCATTC ATCAAATATG GCGAATACCC AAATGAAATC AGACAAAATC 60 

ATTATTGCTC ACCGTGGTGC TAGCGGTTAT TTACCAGAGC ATACGTTAGA ATCTAAAGCA 120 

CTTGCGTTTG CACAACAGGC TGATTATTTA GAGCAAGATT TAGCAATGAC TAAGGATGGT 180 

CGTTTAGTGG TTATTCACGA TCACTTTTTA GATGGCTTGA CTGATGTTGC GAAAAAATTC 240 

CCACATCGTC ATCGTAAAGA TGGCCGTTAC TATGTCATCG ACTTTACCTT AAAAGAAATT 300 

CAAAGTTTAG AAAT G AC AG A AAACTTTGAA ACCATGGGTG GCAAGTGGTC AAAAAGTAGT 360 

GTGGTTGGAT GGCCTACTGT AAGG G AAAG A ATGAGACGAG CTGAGCCAGC AGCAGATGGG 420 

GTGGGAGCAG CATCTCGAGA CCTGGAAAAA CAT GG AGCAA TCACAAGTAG CAATACAGCA 480 

GCTACCAATG CTGCTTGTGC CTGGCTAGAA GCACAAGAGG AGGAGGAGGT GGGTTTTCCA 540 

GTCACACCTC AGGTACCTTT AAGACCAATG ACTTACAAGG CAGCTGTAGA TCTTAGCCAC 600 

TTTTTAAAAG AAAAGGGGGG ACTGGAAGGG CTAATTCACT CCCAACGAAG AC AAG AT AT C 660 

CTTGATCTGT GGATCTACCA C AC AC AAGG C TACTTCCCTG ATTGGCAGAA CTACACACCA 720 

GGGCCAGGGG T C AG AT AT CC ACTGACCTTT GGATGGTGCT ACAAGCTAGT ACCAGTTGAG 7 80 

CCAGATAAGG TAGAAGAGGC CAATAAAGGA GAGAACACCA GCTTGTTACA CCCTGTGAGC 840 

CTGCATGGAA TGGATGACCC TGAGAGAGAA GTGTTAGAGT GGAGGTTTGA CAGCCGCCTA 900 

GCATTTCATC ACGTGGCCCG AGAGCTGCAT CCGGAGTACT TCAAGAACTG CACTAGTGGC 960 
CACCATCACC AT C AC CAT T A A 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Asp Pro Ser Ser His Ser Ser Asn Met Ala Asn Thr Gin Met Lys 
15 10 15 
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Ser Asp Lys He He He Ala His Arg Gly Ala Ser Gly Tyr Leu Pro 

20 25 3° 

Glu His Thr Leu Glu Ser Lys Ala Leu Ala Phe Ala Gin Gin Ala Asp 

35 40 45 

Tyr Leu Glu Gin Asp Leu Ala Met Thr Lys Asp Gly Arg Leu Val Val. 

50 55 SO 

lie His Asp His Phe Leu Asp Gly Leu Thr Asp Val Ala Lys Lys Phe 
65 70 75 80 

Pro His Arg His Arg Lys Asp Gly Arg Tyr Tyr Val He Asp Phe Thr 

85 90 9 5 

Leu Lys Glu He Gin Ser Leu Glu Met Thr Glu Asn Phe Glu Thr Met 

100 105 11° 

Gly Gly Lys Trp Ser Lys Ser Ser Val Val Gly Trp Pro Thr Val Arg 

115 120 125 

Glu Arg Met Arg Arg Ala Glu Pro Ala Ala Asp Gly Val Gly Ala Ala 

Ser Arg Asp Leu Glu Lys His Gly Ala He Thr Ser Ser Asn Thr Ala 
145 150 155 If 0 

Ala Thr Asn Ala Ala Cys Ala Trp Leu Glu Ala Gin Glu Glu Glu Glu 

165 170 175 

Val Gly Phe Pro Val Thr Pro Gin Val Pro Leu Arg Pro Met Thr Tyr 

180 185 190 

Lys Ala Ala Val Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu 

195 200 205 

Glu Gly Leu He His Ser Gin Arg Arg Gin Asp He Leu Asp Leu Trp 

210 215 220 

He Tyr His Thr Gin Gly Tyr Phe Pro Asp Trp Gin .Asn Tyr Thr Pro 
225 230 235 

Gly Pro Gly Val Arg Tyr Pro Leu Thr Phe Gly Trp Cys Tyr Lys Leu 

245 250 255 

Val Pro Val Glu Pro Asp Lys Val Glu Glu Ala Asn Lys Gly Glu Asn 

260 265 270 

Thr Ser Leu Leu His Pro Val Ser Leu His Gly Met Asp Asp Pro Glu 

275 280 285 

Arg Glu Val Leu Glu Trp Arg Phe Asp Ser Arg Leu Ala Phe His His 

290 295 300 

Val Ala Arg Glu Leu His Pro Glu Tyr Phe Lys Asn Cys Thr Ser Gly 
305 310 315 320 

His His His His His His 
325 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1242 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIP1 

ATGGATCCAA GCAGCCATTC 
ATTATTGCTC ACCGTGGTGC 
CTTGCGTTTG CACAACAGGC 
CGTTTAGTGG TTATTCACGA 
CCACATCGTC ATCGTAAAGA 



TON: SEQ ID NO: 20: 

ATCAAATATG GCGAATACCC 
TAGCGGTTAT TTACCAGAGC 
TGATTATTTA GAGCAAGATT 
TCACTTTTTA GATGGCTTGA 
TGGCCGTTAC TATGTCATCG 



AAATGAAATC AGACAAAATC 
ATACGTTAGA ATCTAAAGCA 
TAGCAATGAC TAAGGATGGT 
CTGATGTTGC GAAAAAATTC 
ACTTTACCTT AAAAGAAATT 



SUBSTITUTE SHEET (RULE 2B) 



WO 99/16884 W W PCT/EP98/06040 

12 / 15 

CAAAGTTTAG AAAT G AC AG A AAACTTTGAA ACCATGGGTG GCAAGTGGTC AAAAAGTAGT 360 

GTGGTTGGAT GGCCTACTGT AAGGGAAAGA ATGAGACGAG CTGAGCCAGC AGCAGATGGG 420 

GTGGGAGCAG CATCTCGAGA C CTGGAAAAA CATGGAGCAA T C AC AAGT AG CAATACAGCA 480 

GCTACCAATG -G-TGCTTGTGC -CTGGCTAGAA GCACAAGAGG AGGAGGAGGT GGGTTTTCCA 540 
GTCACACCTC AGGTACCTTT AAGACCAATG ACTTACAAGG CAGCTGTAGA TCTTAGCCAC _ .600 

TTTTTAAAAG AAAAGGGGGG ACTGGAAGGG CTAATT CACT CCCAACGAAG ACAAGATATC 660 

CTTGATCTGT GGATCTACCA CACACAAGGC TACTTCCCTG ATTGGCAGAA CTACACACCA 720 

GGGCCAGGGG TCAGATATCC ACTGACCTTT GGATGGTGCT ACAAGCTAGT ACCAGTTGAG 780 

CCAGATAAGG TAGAAGAGGC CAATAAAGGA GAGAACACCA GCTTGTTACA CCCTGTGAGC 84 0 

CTGCATGGAA TGGATGACCC T GAG AG AGAA GTGTTAGAGT GGAGGTTTGA CAGCCGCCTA 900 

GCATTTCATC ACGTGGCCCG AGAGCTGCAT CCGGAGTACT TCAAGAACTG CACTAGTGAG 960 

CCAGTAGATC CTAGACTAGA GCCCTGGAAG CATCCAGGAA' GTCAGCCTAA AACTGCTTGT 1020 

ACCAATTGCT ATTGTAAAAA GTGTTGCTTT CATTGCCAAG TTTGTTTCAT AACAAAAGCC 1080 

TTAGGCATCT CCTATGGCAG GAAGAAGCGG AGACAGCGAC GAAGACCTCC TCAAGGCAGT 1140 

CAGACTCATC AAGTTTCTCT ATCAAAGCAA CCCACCTCCC AATCCCGAGG GGACCCGACA 1200 

GGCCCGAAGG AAACTAGTGG CCACCATCAC CATCACCATT AA 1242 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 414 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Asp Pro Ser Ser His Ser Ser Asn Met Ala Asn Thr Gin Met Lys 

1 5 10 • 15 

Ser Asp Lys He He He Ala His Arg Gly Ala Ser Gly Tyr Leu Pro 

20 25 30 

Glu His Thr Leu Glu Ser Lys Ala Leu Ala Phe Ala Gin Gin Ala Asp 

35 40 45 

Tyr Leu Glu Gin Asp Leu Ala Met Thr Lys Asp Gly Arg Leu Val Val 

50 55 60 

He His Asp His Phe Leu Asp Gly Leu Thr Asp Val Ala Lys Lys Phe 
65 70 75 80 

Pro His Arg His Arg Lys Asp Gly Arg Tyr Tyr Val He Asp Phe Thr 

85 90 95 

Leu Lys Glu He Gin Ser Leu Glu Met Thr Glu Asn Phe Glu Thr Met 

100 105 HO 

Gly Gly Lys Trp Ser Lys Ser Ser Val Val Gly Trp Pro Thr Val Arg 

115 120 125 

Glu Arg Met Arg Arg Ala Glu Pro Ala Ala Asp Gly Val Gly Ala Ala 

130 135 140 

Ser Arg Asp Leu Glu Lys His Gly Ala He Thr Ser Ser Asn Thr Ala 
145 150 155 160 

Ala Thr Asn Ala Ala Cys Ala Trp Leu Glu Ala Gin Glu Glu Glu Glu 

165 170 175 

Val Gly Phe Pro Val Thr Pro Gin Val Pro Leu Arg Pro Met Thr Tyr 

180 185 190 

Lys Ala Ala Val Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu 

195 200 205 

Glu Gly Leu He His Ser Gin Arg Arg Gin Asp He Leu Asp Leu Trp 

210 215 220 

He Tyr His Thr Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro 



SUBSTITUTE SHEET (RULE 26) 



* ♦ 



WO 99/16884 ^ — PCT/EP98/06040 

13 / 15 

225 230 235 240 

Gly Pro Gly Val Arg Tyr Pro Leu Thr Phe Gly Trp Cys Tyr Lys Leu 

245 250 255 

Val Pro Val Glu Pro Asp Lys Val Glu Glu Ala Asn Lys Gly Glu Asn 

260 ' 265 - 270 

Thr Ser Leu Leu His Pro Val Ser Leu His Gly Met Asp Asp Pro Glu 

275 280 285 

Arg Glu Val Leu Glu Trp Arg Phe Asp Ser Arg Leu Ala Phe His His 

290 295 300 

Val Ala Arg Glu Leu His Pro Glu Tyr Phe Lys Asn Cys Thr Ser Glu 
305 310 315 320 

pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser Gin Pro 

325 330 335 

Lys Thr Ala Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Phe His Cys 

340 345 350 

Gin Val Cys Phe He Thr Lys Ala Leu Gly He Ser Tyr Gly Arg Lys 

355 360 365 

Lvs Arg Arg Gin Arg Arg Arg Pro Pro Gin Gly Ser Gin Thr His Gin 

370 375 380 

Val Ser Leu Ser Lys Gin Pro Thr Ser Gin Ser Arg Gly Asp Pro Thr 
385 390 395 400 

Gly Pro Lys Glu Thr Ser Gly His His His His His His 
405 410 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

ATGGAGCCAG TAGATCCTAG ACTAGAGCCC TGGAAGCATC CAGGAAGTCA GCCTAAAACT 
GCTTGTACCA ATTGCTATTG TAAAAAGTGT TGCTTTCATT GCCAAGTTTG TTTCATAACA 
GCTGCCTTAG GCATCTCCTA TGGCAGGAAG AAGCGGAGAC AGCGACGAAG ACCTCCTCAA 
GGCAGTCAGA CTCATCAAGT TTCTCTATCA AAGCAACCCA CCTCCCAATC CAAAGGGGAG 
CCGACAGGCC CGAAGGAAAC TAGTGGCCAC CATCACCATC ACCATTAA 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Glu Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser 

Gin Pro Lys Thr Ala Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Phe 

20 25 30 

His Cys Gin Val Cys Phe He Thr Ala Ala Leu Gly He Ser Tyr Gly 
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35 40 45 

Arg Lys Lys Arg Arg Gin Arg Arg Arg Pro Pro Gin Gly Ser Gin Thr 

50 55 60 

His Gin Val Ser Leu Ser Lys Gin Pro Thr Ser Gin Ser Lys Gly Glu 
65 " 70 - - 75 80 

Pro Thr Gly Pro Lys Glu Thr Ser Gly His His His His His His 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 909 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

( D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

ATGGGTGGCA AGTGGTCAAA AAGTAGTGTG GTTGGATGGC CTACTGTAAG GGAAAGAATG 60 

AGACGAGCTG AGCCAGCAGC AGATGGGGTG GGAGCAGCAT CTCGAGACCT GGAAAAACAT 120 

GGAGCAATCA CAAGTAGCAA TACAGCAGCT ACCAATGCTG CTTGTGCCTG GCTAGAAGCA 180 

CAAGAGGAGG AGGAGGTGGG TTTTCCAGTC ACACCTCAGG TACCTTTAAG ACCAATGACT 240 

TACAAGGCAG CTGTAGATCT TAGCCACTTT TTAAAAGAAA AGGGGGGACT GGAAGGGCTA 300 

ATTCACTCCC AACGAAGACA AGATATCCTT GATCTGTGGA TCTACCACAC ACAAGGCTAC 360 

yj TTCCCTGATT GGCAGAACTA CACACCAGGG CCAGGGGTCA GATATCCACT GACCTTTGGA 420 

ft] TGGTGCTACA AGCTAGTACC AGTTGAGCCA GATAAGGTAG AAGAGGCCAA TAAAGGAGAG 480 

hj AACACCAGCT TGTTACACCC TGTGAGCCTG CATGGAATGG ATGACCCTGA GAGAGAAGTG 540 

43 TTAGAGTGGA GGTTTGACAG CCGCCTAGCA TTTCATCACG TGGCCCGAGA GCTGCATCCG 600 

GAGTACTTCA AGAACTGCAC TAGTGAGCCA GTAGATCCTA GACTAGAGCC CTGGAAGCAT 660 

h CCAGGAAGTC AGCCTAAAAC TGCTTGTACC AATTGCTATT GTAAAAAGTG TTGCTTTCAT 720 

H TGCCAAGTTT GTTTCATAAC AGCTGCCTTA GGCATCTCCT ATGGCAGGAA GAAGCGGAGA 780 

~ CAGCGACGAA GACCTCCTCA AGGCAGTCAG ACTCATCAAG TTTCTCTATC AAAGCAACCC 840 

]U ACCTCCCAAT CCAAAGGGGA GCCGACAGGC CCGAAGGAAA CTAGTGGCCA CCATCACCAT 900 
CACCATTAA 



909 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 303 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Gly Gly Lys Trp Ser Lys Ser Ser Val Val Gly Trp Pro Thr Val 

1 5 10 . 15 

Arg Glu Arg Met Arg Arg Ala Glu Pro Ala Ala Asp Gly Val Gly Ala 

20 25 30 

Ala Ser Arg Asp Leu Glu Lys His Gly Ala He Thr Ser Ser Asn Thr 

35 40 45 

Ala Ala Thr Asn Ala Ala Cys Ala Trp Leu Glu Ala Gin Glu Glu Glu 

50 55 60 

Glu Val Gly Phe Pro Val Thr Pro Gin Val Pro Leu Arg Pro Met Thr 
65 70 75 80 
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Tvr Lys Ala Ala Val Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly 

85 90 95 

Leu Glu Gly Leu lie His Ser Gin Arg Arg Gin Asp He Leu Asp Leu 
100 105 HO 

Trp He Tyr His Thr Gin Gly Tyr Phe Pro Asp -Trp Gin Asn Tyr Thr 

115 120 125 

Pro Gly Pro Gly Val Arg Tyr Pro Leu Thr Phe Gly Trp Cys Tyr Lys 

130 135 140 

Leu Val Pro Val Glu Pro Asp Lys Val Glu Glu Ala Asn Lys Gly Glu 
145 150 155 160 

Asn Thr Ser Leu Leu His Pro Val Ser Leu His Gly Met Asp Asp Pro 

165 170 175 

Glu Arg Glu Val Leu Glu Trp Arg Phe Asp Ser Arg Leu Ala Phe His . 

180 185 190 

His Val Ala Arg Glu Leu His Pro Glu Tyr Phe Lys Asn Cys Thr Ser 

195 200 205 

Glu Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser Gin 

210 215 220 

Pro Lys Thr Ala Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Phe His 
225 230 235 240 

Cvs Gin Val Cys Phe He Thr Ala Ala Leu Gly He Ser Tyr Gly Arg 

245 250 255 

Lvs Lys Arg Arg Gin Arg Arg Arg Pro Pro Gin Gly Ser Gin Thr His 

260 265 270- 

Gin Val Ser Leu Ser Lys Gin Pro Thr Ser Gin Ser Lys Gly Glu Pro 

275 280 285 

Thr Gly Pro Lys Glu Thr Ser Gly His His His His His His 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

TTCGAAACCA TGGCCGCGGA CTAGTGGCCA CCATCACCAT CACCATTAAC GGAATTC 57 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

( B ) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Thr Ser Gly His His His His His His 
1 5 



SUBSTITUTE SHEET (RULE 26) 



